Spoken Language Recognition: From Fundamentals to Practice
نویسندگان
چکیده
منابع مشابه
From Speech Recognition to Spoken Language Understanding
Spoken language is one of the most natural, efficient, flexible, and economical means of communication among humans. As computers play an ever increasing role in our lives, it is important that we address the issue of providing a graceful human-machine interface through spoken language. In this paper, we will describe our recent efforts in moving beyond the scope of speech recognition into the ...
متن کاملTopics and Spoken Language Recognition
This paper concerns the use of discourse information in spoken dialogue systems, and argues that knowledge of the domain, modelled with the help of topic trees, is important in improving the recognition accuracy of spoken utterances. The use of discourse information in spoken language systems is usually motivated by arguments concerning the system's robustness: (1) support for speech recognitio...
متن کاملSpoken Language Recognition and Understanding
The goal of this research is to demonstrate spoken language systems in support of interactive problem solving. The system accepts continuous speech input and handles multiple speakers without explicit speaker enrollment. The MIT spoken language system combines SUMMIT, a segment-based speech recognition system, and TINA, a probabilistic natural language system, to achieve speech understanding. T...
متن کاملSpoken Word Recognition Strategy for Tamil Language
This paper outlines a strategy for recognizing a preferred vocabulary of words spoken in Tamil language. The basic philosophy is to extract the features using mel frequency cepstral coefficients (MFCC) from the spoken words that are used as representative features of the speech to create models that aid in recognition. The models chosen for the task are hidden Markov models (HMM) and autoassoci...
متن کاملSpoken language recognition based on senone posteriors
This paper explores in depth a recently proposed approach to spoken language recognition based on the estimated posteriors for a set of senones representing the phonetic space of one or more languages. A neural network (NN) is trained to estimate the posterior probabilities for the senones at a frame level. A feature vector is then derived for every sample using these posteriors. The effect of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the IEEE
سال: 2013
ISSN: 0018-9219,1558-2256
DOI: 10.1109/jproc.2012.2237151